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AMENDMENTS TO THE CLAIMS 

Please add or amend the claims to read as follows, and cancel without prejudice or 
disclaimer to resubmission in a divisional or continuation application claims indicated as 
cancelled: 

1. (Canceled) 

2. (Currently amended) An automatic method of classifying molecules having similar 

biologic function comprising the steps of: 

a) automatically creating a hierarchical organization of the molecules in a 
database comprising the steps of: 

i calculating pairwise similarities between said molecules in said 
database by combining at least one standard measure of similarity, resulting in a 
first set of expectation values of similarity; 

ii analyzing said first set of expectation values of similarity so as to 
obtain a second set of expectation values of similarity, wherein said molecules of 
the second set have a high degree of similarity; 

iii merging said resulting molecules of step ii) so as to form clusters, 
wherein only molecules with an expectation value below a first restricted 
threshold are merged; 

iv identifying groups of clusters using focal consideration resulting in 
related clusters; 

v deterrrihiing the relationship between related dusters in said groups; 

vi analyzing said groups of the related clusters of step v), thereby 
creating a hierarchical organization of said molecules; 

b) automatically determining the position of a selected molecule in said 
database based on the hierarchical organization of step a, comprising the steps of: 
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i identifying pairwise similarities between said selected molecule to the 
molecules in said database by combining at least one standard measure of 
similarity, resulting in a third set of expectation values of similarity; 

ii identifying geometric averaging of said selected molecule to each of 
said resulting clusters in said hierarchy, resulting in a forth set of expectation 
values of similarity; 

iii identifying related clusters from said hierarchy of step a, having a 
geometric averaging with said selected molecule below a second threshold, 
thereby classifying molecules having similar biologic function. 

3. (Original) The method of claim 2, wherein each related cluster resulting from 
step iv is further analyzed, wherein if said geometric averaging of said related 
cluster to a second related cluster is below said first threshold a connection is 
established between two clusters. 

k (Original) The method of ciaim 3, wherein said cluster is analyzed by 
calculating the geometric averaging between molecules in one cluster to 
molecules in the other cluster. 

(Original)The method of claim 2, wherein the method of determining 
the relationship between said related clusters in step v further comprises the steps 
of applying a global test on said connected clusters for identifying nuclei of 
strong relationships within said groups of clusters, wherein each cluster is 
checked against its nearest cluster by using said geometric averaging, wherein 
if said geometric averaging is below said first threshold the two clusters 
merged. 

(Original) The method of claim 2, further comprises repeating steps iv, v 
while raising the threshold in a step wise manner until a third threshold is 
achieved. 
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7. (Original)The method according to claim 2, wherein said groups containing 
the clusters of every two molecules from different clusters with expectation 
value below the threshold. 

8. (Original) The method according to claim 2, wherein the method further 
comprises applying three standard measures of similarity. 

9. (Original) The method according to claim 8, wherein said standard measures of 
similarity are Smith Waterman (SW), FASTA, and BLAST. 

10. (Original) The method according to claim 9, further comprising the step of 
applying a numerical normalization to the SW, the FASTA and the BLAST. 

1 1 . (Original) The method according to claim 10, wherein the method step a) ii) is 
analyzed using thresholds 0.1, 0.1 and 10' 3 for SW, FASTA and BLAST 
respectively wherein a value of pairwise similarities between two protein 
sequences is maintained if either SW or FASTA yield an expectation value <0.1 or 
BLASTS expectation value is < 10" 3 . 

1 2. (Original) The method according to claim 1 1 , further comprising filtering the 
results of BLAST thereby excluding low complexity segments, using the SEG 
program; or setting a more stringent threshold for BLAST at 10" 6 if filtering 
decreased the number of related sequences by half. 

13. (Original) The method according to claim 12, further comprising mamtaining 
a value of an expectation value of similarity between two protein when all three 
methods yield an expectation value < 1. 

14. (Withdrawn) The method of claim 1 , wherein the molecules are nucleic acids. 

15. (Withdrawn) The method of claim 1, wherein the nucleic acids are DNA or 
RNA. 



16. 



(Withdrawn) The method of claim 1, wherein the molecules are 
pofynucleotides. 
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17. (Currently amended) The method of claim 4-2, wherein the molecules are 
proteins. 

18. (Currently amended) The method of claim 42, wherein the molecules are 
peptides. 



1 9. (Withdrawn) The method of claim 1 , wherein the molecules are glycoproti 



;ems. 



20. (Withdrawn) The method of claim 1, wherein the molecules are complex 
sugars. 

21. (Currently amended) The method of claim +2, wherein the molecules are 
immunoglobulins. 

22. (Currently amended) The method of claim 4-2, wherein the molecules are ' 
chemicals. 

23. (Currently amended) The method of claim 42, wherein said database is the 
Swissprot database. 

24. (Canceled) 

25. (Canceled) 

26. (Canceled) 

27. (Currently amended) A system for identifying biological families of 
molecules having si milar biologic function , said system comprising : 

a) a storage device containing a first database identifying 
molecules and their corresponding hierarchy; 

b) a processor connected to said storage device and in 
communication with a second database containing molecule 
sequences; 
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an input device connected to said processor for inputting a selected 
molecule to be analyzed, wherein said processor is programmed te for 
automatically classify classifying molecules having similar biologic function, 
said classifying comprising the steps of: 

a) automatically creating a hierarchi cal organization of the molecules in 
database comprising the steps of: 

i calculating pairwise similaritie s between said molecules in saiH 
database by combi ning at least one standard meas ure of similarity, resulting in a 
first set of expectation values of similarity; 

ii analyzing said first set of exp ectation values of similarity so as tn 
obtain a second se t of expectation values of similarity, where in said molecules of 
the second set have a high degree nf similarity; 

iii merging said resulting molecu les of step ii ) so as tn form clusters. 
wherein only mol ecu les w ith an expectation value helnw a first resfrirteH 
threshold are merged: 

iv identifying groups of clusters u sing focal consideration resulting in 
related clusters: 

v determining the relationship b etween related dusters in said g roups: 

vi analyzing said groups of the relat e d clusters of step vV thereby 
creating a hierarchical organization of said molecules; 

b) automatically determining the posit i on of a selec ted molecule in sain 
database based on the hierarchical organisatio n of step a. com prising the step* of- 

i identifying pairwise similarities b etween said selected molecule to the 
molecules in said database h v combining at least one standard measure of 
similarity, resulting in a third set of expectation v alues of similarity; 
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ii identifying g eometric averaging of said selected molecule to each of 
said result ing clusters in said hierarchy, resulting in a forth set of expectation 
values of similarity : 

iii identifying related clusters from said hierarchy of step a, having a 
geometric averaging with said selected molecule below a second threshold, 
thereby classi fying molecules having similar biologic function, 

thereby identifying biological families nf molecules having similar 
biologic function . 

28. (New) An article comprising a storage medium having stored therein 

instructions that, when executed by a processor, result in: 

r 

classifying molecules having similar biologic function, said classifying 
comprising the steps of: 

a) automatically creating a hierarchical organization of the molecules in a 
database 'comprising the steps of: 

i calculating pairwise similarities between said molecules in said 
database by combining at least one standard measure of similarity, resulting in a 
first set of expectation values of similarity; 

ii analyzing said first set of expectation values of similarity so as to 
obtain a second set of expectation values of similarity, wherein said molecules of 
the second set have a high degree of similarity, 

iii merging said resulting molecules of step ii) so as to form clusters, 
wherein only molecules with an expectation value below a first restricted 
threshold are merged; 

iv identifying groups of clusters using focal consideration resulting in 
related clusters; 
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v determining the relationship between related dusters in said groups; 

vi analyzing said groups of the related clusters of step v), thereby 
creating a hierarchical organization of said molecules; 

b) automatically determining the position of a selected molecule in said 
database based on the hierarchical organization of step a, comprising the steps of: 

i identifying pairwise similarities between said selected molecule to the 
molecules in said database by combining at least one standard measure of 
similarity, resulting in a third set of expectation values of similarity; 

ii identifying geometric averaging of said selected molecule to each of 
said resulting clusters in said hierarchy, resulting in a forth set of expectation 
values of similarity; 

iii identifying related clusters from said hierarchy of step a, having a 
geometric averaging with said selected molecule below a second threshold, 
thereby classifying molecules having similar biologic function. 



